Multifont Ottoman Character Recognition

نویسندگان

  • Ali ÖZTÜRK
  • Yüksel ÖZBAY
چکیده

Ottoman characters from three different fonts are used character recognition problem, broadly speaking, is transferring a page that contain symbols to the computer and matching these symbols with previously known or recognized symbols after extraction the features of these symbols via appropriate preprocessing methods. Because of silent features of the characters, implementing an Ottoman character recognition system is a difficult work. Different researchers have done lots of works for years to develop systems that would recognize Latin characters. Although almost one million people use Ottoman characters, great deal of whom has different native languages, the number of studies on this field is insufficient. In this study 28 different machine-printed to train the Artificial Neural Network and a %95 classification accuracy for the characters in these fonts and a %70 classification accuracy for a different font has been found.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multifont Classification using Typographical Attributes

This paper introduces a multifont classification scheme to help recognition of multifont and multisize characters. It uses typographical attributes such as ascenders, descenders and serifs obtained from a word image. The attributes are used as an input to a neural network classifier to produce the multifont classification results. It can classify 7 commonly used fonts for all point sizes from 7...

متن کامل

Word-level recognition of multifont Arabic text using a feature vector matching approach

Many text recognition systems recognize text imagery at the character level and assemble words from the recognized characters. An alternative approach is to recognize text imagery at the word level, without analyzing individual characters. This approach avoids the problem of individual character segmentation, and can overcome local errors in character recognition. A word-level recognition syste...

متن کامل

Word Recognition With Multi-Level Contextual Knowledge

A word recognition algorithm is proposed that integrates character recognition with word shape analysis. The algorithm consists of a set of serial filters and parallel classifiers, and the decisions are combined to generate a consensus ranking of the input lexicon. Experimental results with multifont machine-printed word images are discussed.

متن کامل

An OCR System for Printed Documents

This paper describes the general structure of a full automated document analysis system for printed documents. The system is based on a character preclassification stage which reduces the number of patterns to recognize and introduces a new contextual processing. This specific approach for multifont printed documents reading is based on pattern character redundancies. With the study of prototyp...

متن کامل

ntegrated segmentation and recognition of onnected Ottoman script

smet Zeki Yalniz smail Sengor Altingovde ğur Güdükbay zgür Ulusoy ilkent University epartment of Computer Engineering ilkent, Ankara, 06800 urkey -mail: [email protected] Abstract. We propose a novel context-sensitive segmentation and recognition method for connected letters in Ottoman script. This method first extracts a set of segments from a connected script and determines the candi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000